Quantum Theory Contains Maximally Incompatible Observables 
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We show that Quantum Theory contains observables that are as incompatible as any probabilistic 
physical theory can have. We define the joint measurability region of a probabilistic theory which 
describes the amount of added noise needed to make any pair of observables in the theory jointly 
measurable. We then show that the joint measurability region of Quantum Theory is contained in 
the joint measurability region of any other probabilistic theory. 
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Quantum Theory has a number of important features 
not known in classical physics, ranging from the super- 
position and indeterminacy principles formulated by the 
pioneers to the more recently discovered no-cloning and 
no-broadcasting theorems. It is an old problem to iden- 
tify operationally significant properties of Quantum The- 
ory that distinguish it from other probabilistic theories. 
In recent years many features have been under intensive 
investigation from this perspective, including information 
processing [IJ, optimal state discrimination [2], entropy 
[3], purification j4] and discord [5]. It has been found 
that some properties are quite generally valid in any non- 
classical (no-signaling) probabilistic theories while others 
are specifically quantum. 

The existence of pairs of incompatible observables 
marks one of the most striking distinctions between 
Quantum Theory and classical physical theories. There 
are many manifestations of incompatibility, perhaps the 
most famous being the Heisenberg Uncertainty Principle 
6 . In this Letter we demonstrate that Quantum Theory 
contains observables that are as incompatible (in a sense 
to be defined) as observables in any probabilistic theory 
can be. 

Our aim is thus to compare Quantum Theory to other 
possible probabilistic physical theories. We first need 
to set some minimal constraints. A probabilistic theory 
is a framework that provides a description of physical 
systems in terms of states and observables with the 
following general properties: 

(i) The states of a system are represented by the 
elements of a convex subset of a real vector space. 

(ii) An observable is represented as an affine mapping 
from the set of states into the set of probability distribu- 
tions on some outcome space. For simplicity, we restrict 
ourselves here to observables with a finite or countable 
number of outcomes. 
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(iii) Any affine mapping from the set of states into the 
set of probability distributions is a valid observable. 

We consider a particular probabilistic theory (PT) as 
given by a family of convex sets of states with associated 
sets of observables that share some properties specific to 
that PT. One may think of each pair consisting of a set of 
states with associated set of observables as an instance of 
a PT representing a particular type of physical system. 

Given a PT, we denote by Wl(j\g) the probability of 
obtaining a measurement outcome j when an observable 
£DT is measured in a state g. Hence, < DJl(j\g) < 1 and 
J2j 9H(j|£>) = 1- We will typically label the measurement 
outcomes by integers. 

In Quantum Theory the states are described by den- 
sity operators and observables correspond to POVMs [7J. 
Their duality is given by the trace formula (with g a den- 
sity operator and M a POVM) 

M(j\g) = ti [gM(j)} . (1) 

Another example of a probabilistic theory is a classi- 
cal theory, where the states are probability measures on 
a phase space H, and observables are traditionally rep- 
resented as functions m : fl — >• R; the associated affine 
maps from states g to probability distributions are then 
given by the formula 

m(j\g) = g({xem-\j)}). (2) 

Continuing our discussion on general probabilistic the- 
ories, we note that it follows from the required proper- 
ties (i)-(iii) that the set of observables is a convex set; a 
mixture of two observables is an observable. Physically 
mixing corresponds to an experiment where we switch 
between two measurement apparatuses with a random 
probability. We can directly write a mixture of two ob- 
servables with the same set of measurement outcomes. 
If the sets of measurement outcomes differ, we can still 
write a mixture by first adding enough outcomes and 
then embedding both sets into Z. 

Another consequence of the basic requirements is that 
every constant mapping g^p, where p is a fixed proba- 
bility distribution, is an observable and we call it a trivial 
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observable. A trivial observable *T corresponds to a dice 
rolling experiment, where we randomly pick the mea- 
surement outcome according to a given fixed probabil- 
ity distribution, without manipulating the state at all. 
In Quantum Theory, trivial observables are described by 
POVMs T such that each operator T(j) is a multiple of 
the identity operator, i.e., T(j) — tjl for some < t < 1 
with J2j tj = 1. 

The concept of joint measurement can be defined in 
any probabilistic theory. Two observables 3Jti and DJl 2 
are jointly measurable if there exists an observable DJl 
such that 

^ m(j, k\ g ) = m 1 (j\g),y] m(j, k\ e ) = m(k\ g ). 
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(3) 

In this case 9JI is called a joint observable of 97ti and 9712- 
If 9Jti and 9712 are not jointly measurable, then we say 
that they are incompatible. 

Any probabilistic theory contains jointly measurable 
pairs of observables. Namely, a trivial observable g 1— > p 
is jointly measurable with any other observable; we can 
write a joint observable 



Tt(j,k\ Q )=m 1 (j\g)p(k) 



(4) 



for the trivial observable and any other observable SUti . 
This simply corresponds to an experiment where we mea- 
sure 9Jti and simultaneously roll a dice. It is a well known 
fact that, in Quantum Theory, an observable which is 
jointly measurable with all other observables is necessar- 
ily a trivial observable. Indeed, any POVM element of 
such an observable commutes with all projections and 
must therefore be a scalar multiple of the identity (e.g. 
Theorem IV. 1.3.1]). 

The following simple observation is a key ingredient for 
our discussion. 

Proposition 1. LetVfti andd)l 2 be two observables and 
< A < 1. Then XM 1 + (1-X)1 1 and (1~X)M 2 +X1 2 are 
jointly measurable for any choice of trivial observables Ti 
and %2 ■ 

This proposition can be proved with the following con- 
struction. First, let p\ and p 2 be the probability distri- 
butions related to TL\ and C I 2 . We define an observable 
9Jt by formula 

m(j,k\g) = Xp 2 (k) m 1 (j\g) + (l-X) Pl (j) M 2 (k\g) . (5) 

For a fixed g, the right hand side is clearly a proba- 
bility distribution. Moreover, the right hand side is an 
affine mapping on g; therefore 5Ut is an observable. The 
marginal observables are 

J2m(j,k\g) = A9Jti (j I g) + (1 - X) Pl (j), 

k 

»t0-, k\g) - (1 - X)M 2 (k\g) + X P2 (k). 




FIG. 1: (Color online) The region J(M X , M y ) for two orthog- 
onal spin- 1 measurements is a quadrant of the unit disk. The 
region A (light) is a subset of J(9Jti , OT2) for any pair OTi , OT2 , 
while the surplus region (dark) depends on the specific pair 
under consideration. 



This proves Prop. [T] 

The physical idea behind this construction is the fol- 
lowing. In each measurement run we flip a coin and, 
depending on the result, we measure either SDTi or 9Jt 2 in 
the input state g. In this way we get a measurement out- 
come for either 9Jli or 9Jt 2 . In addition to this, we roll a 
dice and pretend that this is a measurement outcome for 
the other observable. In this way we get an outcome for 
both observables simultaneously. The overall observable 
is the one given in formula ([5]). 

For two observables 971 1 and $Jl 2 , we denote by 
J(3Ki,27t 2 ) the set of all points (A,/i) G [0,1] x [0,1] 
for which there exist trivial observables %\ , 1 2 such that 
AOJti + (1 — A)Ti and fi9Jl% + (1 — (x)%2 are jointly mea- 
surable, and we call J(!2ti, 5DT2) the joint measurability 
region of dJti and 9Jt 2 . The joint measurability region 
thus characterizes how much noise (in terms of trivial 
observables) we need to add to be able to have a joint re- 
alization of 50? 1 and 9Jt2- Clearly, DJli and 9Jl 2 are jointly 
measurable if and only if (1, 1) e J(9Jti, 9Jt 2 ). 

The joint measurability region J(9Jti, 9H2) is a convex 
region which can be plotted in the plane. This follows as 



a simple application of Prop. 2 of [9 
this lemma it also follows from Prop 



with the help of 
TJthat 



A = {(A, (i) g [0, 1] x [0, 1] : A + fx < 1} C J(9Jt l5 0Jl 2 ) . 

As an example, suppose that we are within Quantum 
Theory and 9Jti and DJl 2 correspond to spin-| measure- 
ments in two orthogonal directions, say x and y -axes. 
We then describe them with two POVMs M x and M 
where 



M a (±l) = i(I±0, M v (±l) = i(/±tr v ), (6) 

and o~ x , o~ y are the usual Pauli matrices in C 2 . It has been 
shown in |10j that for the uniformly distributed trivial 
observable ±1 i/ (hence describing an unbiased coin), 
the two observables XM X + (1 — A)^l and fiM y + (l — 
are jointly measurable if and only if A 2 + ff 2 < 1. By 
Prop. 3 of [9] this inequality is a necessary condition 
for the joint measurability of any pair AM^ + (1 — A)Ti 
and fJ-My + (1 — /J.)T 2 , where Ti,T2 are arbitrary trivial 
observables. Therefore, we conclude that 

J(M X , M y ) = {(A, fi) G [0, 1] x [0, 1] : A 2 + fi 2 < 1} . (7) 
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FIG. 2: (Color online) The region A (light) is a subset of 
the joint measurability region Jpt (colored) in any proba- 
bilistic theory. The larger the surplus region (dark) is, the 
more jointly measurable the theory globally is. If (a) and (b) 
would be joint measurability regions for two different prob- 
abilistic theories, then we would conclude that (b) is more 
incompatible than (a). 



Proof of Theorem^ We have earlier seen that A C Jpt, 
so we need to show that Jqt Q A. Let (A,/x) ^ A, i.e., 
A + (J, > 1. Fix e > such that A + ^ > 1 + e. We then 
choose d to be a positive integer satisfying 



Vd-l 



1 



< e. 



(8) 



(This can be done since the left hand side — > when 
d — > oo.) We will consider a quantum system that 
is described by a d-dimensional Hilbert space Hd- Let 
{fj}jZo be an orthonormal basis for Hd- We define an- 
other orthonormal basis {4>k\k=Q f° r ^-d by 



ip k = l/v^y^i 



(9) 



This region is depicted in Fig. [T] 

We are now ready to define the main concept of this 
Letter. A global joint measurability feature of a proba- 
bilistic theory PT is characterized by the intersection of 
all the sets J(!jrti, 9Jt 2 ) across all instances of PT, and we 
denote 

J PT ={(A, n) e [0, 1] x [0, 1] : (A, m) € J(£Wi,£K 2 ) 
for all pairs of observables CtTti and 9Jt 2 
in all instances of PT}. 

We call Jpt the joint measurability region of PT. We al- 
ways have A C Jpp, but Jpt can be larger than A. The 
larger the surplus region is, the more jointly measurable 
the theory is globally; see Fig. [2) If (A,/x) £ J PT , this 
means that there is a pair of observables 9Hi and SUI2 such 
that the mixtures A9Jti + (1 - A)Ti and piM 2 + (1 - fJ,)1 2 
are incompatible with any choice of trivial observables 
Ti and T 2 . 

Since Jpt can be defined in any probabilistic theory, 
we can compare the joint measurability regions of dif- 
ferent probabilistic theories. We obviously have Jpt = 
[0, 1] x [0, 1] in any probabilistic theory where all measure- 
ments are jointly measurable, such as the classical prob- 
ability theory. In the most incompatible case we would 
have Jpt = A. We will next show that Quantum Theory 
is, globally, as incompatible as a probabilistic theory can 
be. 

Theorem 1. In Quantum Theory Jqt = A. In partic- 
ular, Jqt C Jpt for any probabilistic theory PT. 

In Quantum Theory every observable 9JI corresponds 
to a unique POVM M by equation ([!]). We will prove 
that for any pair (A, pi) £ A, there are quantum observ- 
ables Mi and M 2 such that the mixtures XM 1 + (1 — A)^ 
and fj,M 2 + (1 — m)T2 are incompatible with any choice 
of trivial observables Ti, T 2 . Our proof is based on a re- 
cent result [11] on the joint measurability region for two 
complementary observables, which is a generalization of 
the result illustrated in Fig. [T] 



The orthonormal bases {<Pj}j = o and {i^k}t=o 



ally unbiased, i.e., \(ifj \i/Jk)\ 
POVMs Mi and M 2 by 

MiC?) = \<Pj)(<Pj\, 



constant. 



are mutu- 
We define two 



(10) 



We thus obtain a pair of c?-outcome observables on 
Hd- Since Mi and M 2 consist of projections and 
Mi(j)M 2 (fc) ^ M 2 (fc)Mi(j), it follows that they are in- 
compatible. 

As proved in [11 , the observables A'Mi + (1 - A')Ti 
and /i'M 2 + (1 — /i')T 2 are incompatible for any choice of 
trivial observables Ti , T 2 whenever 



Since 



A' + // > 1 



X + fi>l + e>l 



fd-\ 



Vd-1 
d-l 



(11) 



(12) 



□ 



we conclude that (A, fx) £ Jqt- 

In the proof of Theorem [T] we have used quantum ob- 
servables with arbitrarily many (but a finite number of) 
outcomes. It is natural to ask if this is a necessary trick 
in order to reach the conclusion Jqt = A. To present 
a partial answer to this question, let us investigate the 
joint measurability region in the case of pairs of binary 
quantum observables. Our aim is to show that 

{(A, n) e [0, 1] x [0, 1] : A 2 + f < 1} C J(Mi, M 2 ) 

for any binary observables Mi and M 2 , regardless of the 
dimension of the Hilbert space. In other words, we will 
show that two orthogonal spin observables are as incom- 
patible as any binary observables can be. 

To this end, let us note that two binary quantum ob- 
servables are incompatible if and only if they enable a vi- 
olation of the Bell-CHSH inequality [T^]. We must there- 
fore look at the Bell expression 



B= |(MiNi 



(MiN 5 



(M 2 Ni) 



(MolSL 
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FIG. 3: In (a) the grey area represents the possible values that 
a and fi can obtain by varying the observables and the state 
in the Bell expression B = \a + fi\. The solid lines represent 
the Tsirelson bound B = 2v2 and the dashed lines represent 
the bound B — 2. By considering only observables which 
are mixtures with the uniformly distributed trivial observable 
with fixed A and fi, the area becomes smaller as depicted in 
(b), and a suitable choice of weights makes the violation of 
the Bell-CHSH inequality impossible. 



Let us denote a = (MiNi) + (MiN 2 ) and fi = (M 2 Ni) - 
(M2N2). Using, e.g., [13j Theorem 1] one verifies that 
by varying the observables Mj and IMj and the bipartite 
state the numbers a and fi obtain all of the values from 
the disc a 2 + fi 2 < 4. If we now mix the observables 
Mj with the trivial observable T(±l) = \l with some 
weights A and /1 we see that the pair (a, fi) turns into 
(Aa, fi/3), thus changing the Bell expression from \a + fi\ 
to |Ao;+/i/3|. We must therefore determine those (A, fi) for 
which \Xa + (i/3\ < 2 for all (a, fi) satisfying a 2 + fi 2 < 4 
(see Fig. [3]). But the boundary curve for this region is 
obtained when the equations (a / A) 2 + (fi / fi) 2 = 4 and a+ 
fi = 2 have at most one common solution. By inserting 
fi = 2 — a into the first equation the problem reduces to 
determining when the discriminant is negative or zero, 
and one readily verifies that this is the case exactly when 
A 2 + /x 2 <l- 

In conclusion, given any pair of binary observables M 1 
and M 2 , and weights A and fi with A 2 + /i 2 < 1, the 
mixtures A Mi + (1 — A)T and ^M2 + (1 — /i)T can not 
be used to violate the Bell-CSHS inequality and must 
therefore be jointly measurable. We note that in the 
case [i = A the same result using a different technique 
has been obtained by Banik et al. [T4"] . 

We note that in the proof of Theorem [l] we have used 
quantum observables acting on an arbitrarily high (but 
finite) dimensional (but finite) Hilbert space. We can 
again ask if this is a necessary trick in order to reach 
the conclusion Jqt — A. Since two mutually unbiased 
bases are expected to be among the most incompatible 
observables in a fixed dimension d, our construction sug- 
gests that the conclusion Jqt = A can be reached only 
if one considers arbitrarily high dimensions. A proof of 
this fact is, however, lacking. 

We show finally that the conclusion Jqt = A can be 
reached by using a single pair of incompatible observ- 



ables if we consider an infinite dimensional system and 
observables with infinite number of outcomes. 

Let H be an infinite dimensional Hilbert space and 
write it as a direct sum of finite <i-dimensional Hilbert 
spaces Hd, H = (Bd=2^ d - ^ n eacn %d consider a pair 
of mutually unbiased orthonormal bases {^}^Zq and 

{4>k}k=oi wnere the latter is obtained from the first one 
by the formula @. We define two POVMs Ni and N 2 



H t (d,3) = \<pf){<pf\ . Md,A) = |^X^I ■ (13) 
These observables act in the infinite dimensional Hilbert 



space T~L and d in (13) is an index labeling the different 



outcomes. The outcome space of Ni and N2 is fl c 
{{d,j):d = 2,3,...,j = 0,...,d-l}. 



Theorem 2. The observables Ni and N2 defined in (13) 
satisfy J(N X , N 2 ) = A. 

Proof of Theorem [1| Let p± and p 2 be two probability 
distributions defined on £loo. Assume that A + /i > 1 
and define two observables Ni a, N 2 ,^ via 



Ni,A(d,j) = \\<pj)(rf\ + (l-\) Pl (d,j)I, 
N a , M (d,fe) = m|^)<^| + (1-m)P2(<U)I. 



(14) 



We need to show that N^a and N2 lM are incompati- 
ble. To prove this, we make a counter assumption that 
Ni.a, N2. M are jointly measurable. This implies that for 
any projection P on H, the projected observables PNi^P 
and PN2 lM -P acting on a subspace PH are jointly mea- 
surable. (If G is a joint observable of two observables 
Mi, M 2 , then PGP is a joint observable of PMiP, PM 2 P 
in PH.) Especially, the projections of Ni.a and N2 )Al to 
any subspace JLd should be jointly measurable. But from 
the result cited in the proof of Theorem [l] we know that 
for d large enough, the projections to T-Ld are incompati- 
ble. Hence, Ni a and N2. M are incompatible. □ 



We note that the observables Ni and N2 defined in ( 13 ) 
are not the only pair that satisfy J(Ni,N2) = A. Namely, 
we can modify Ni and N 2 in any chosen subspace "Hd but 
the conclusion J(Ni, N2) = A is still true since it depends 
on the fact that Ni and N2 contain mutually unbiased 
bases in arbitrarily high dimension. An interesting ques- 
tion within Quantum Theory is to find a characterization 
of all pairs of quantum observables Mi,M2 that satisfy 
J(Mi, M 2 ) = A. Do they exist in finite dimensions? Can 
they have finite numbers of outcomes? 
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